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Abstract 

For a graph G with real weights assigned to the vertices (edges), the MAX iJ-SUBGRAPH 
problem is to find an i7-subgraph of G with maximum total weight, if one exists. Our main 
results are new strongly polynomial algorithms for the MAX i/-SUBGRAPH problem. Some 
of our algorithms are based, in part, on fast matrix multiplication. 

For vertex-weighted graphs with n vertices we solve a more general problem - the all pairs 
MAX i?-SUBGRAPH problem, where the task is to find for every pair of vertices u, v, a maxi- 
mum ff-subgraph containing both u and v, if one exists. We obtain an 0(n*'^'^'^)) time algorithm 
for the all pairs MAX _ff-SUBGRAPH problem in the case where H is a. fixed graph with h 
vertices and lu < 2.376 is the exponent of matrix multiplication. The value of t{uj, h) is de- 
termined by solving a small integer program. In particular, heaviest triangles for all pairs can 
be found in 0(n^+^/'^*^'^^) < o{n^'^^^) time. For h — 4,5,8 the running time of our algorithm 
essentially matches that of the (unweighted) iJ-subgraph detection problem. Using rectangu- 
lar matrix multiplication, the value of 1(0;, h) can be improved; for example, the runtime for 
triangles becomes 0(ri^-^^^). 

We also present improved algorithms for the MAX i?-SUBGRAPH problem in the edge- 
weighted case. In particular, we obtain an 0(m^~^/'^ logn) time algorithm for the heaviest 
cycle of length 2k or 2fc — 1 in a graph with m edges and an 0(n'^/logn) time randomized 
algorithm for finding the heaviest cycle of any fixed length. 

Our methods also yield efficient algorithms for several related problems that are faster than 
any previously existing algorithms. For example, we show how to find chromatic i/-subgraphs 
in edge-colored graphs, and how to compute the most significant bits of the distance product of 
two real matrices, in truly sub-cubic time. 
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1 Introduction 



Finding cliques or other types of subgraphs in a larger graph are classical problems in complexity 
theory and algorithmic combinatorics. Finding a maximum clique is NP-Hard, and also hard to 
approximate jHa98| . This problem is also conjectured to be not fixed parameter tractable |DF95j . 
The problem of finding (induced) subgraphs on k vertices in an n-vertex graph has been studied 
extensively (see, e.g., |AYZ95I IAYZ971 I(:N851 lEHnil IKKMOni lNP85l IPYSTI |YZ04]). All known 
algorithms for finding an induced subgraph on k vertices have running time n^^^\ Many of these 
algorithms use fast matrix multiplication to obtain improved exponents. 

The main contribution of this paper is a set of improved algorithms for finding (induced) /c- vertex 
subgraphs in a real vertex-weighted or edge-weighted graph. More formally, let G be a graph with 
real weights assigned to the vertices (edges). The weight of a subgraph of G is the sum of the weights 
of its vertices (edges). The MAX //-SUBGRAPH problem is to find an //-subgraph of maximum 
weight, if one exists. Some of our algorithms are based, in part, on fast matrix multiplication. 
In several cases, our algorithms use fast rectangular matrix multiplication algorithms. However, 
for simplicity reasons, we express most of our time bounds in terms of lo, the exponent of fast 
square matrix multiplications. The best bound currently available on ui is u < 2.376, obtained by 
Coppersmith and Winograd |CW90j . This is done by reducing each rectangular matrix product to 
a collection of smaller square matrix products. Slightly improved bounds can be obtained by using 
the best available rectangular matrix multiplication algorithms of Coppersmith |Cop97| and Huang 
and Pan |HP98j . In all of our algorithms we assume that the graphs are undirected, for simplicity. 
All of our results are applicable to directed graphs as well. Likewise, all of our results on the MAX 
//'-SUBGRAPH problem hold for the analogous MIN //-SUBGRAPH problem. As usual, we use 
the addition-comparison model for handling real numbers. That is, real numbers are only allowed 
to be compared or added. In particular, our algorithms are strongly polynomial. 

Our first algorithm applies to vertex-weighted graphs. In order to describe its complexity we 
need to define a small (constant size) integer optimization problem. Let /i > 3 be a positive integer. 
The function t{uj,h) is defined by the following optimization program. 

Definition 1.1 

&i = max{6GAr : ^ < [^J}. (1) 

si = h-h + -^. (2) 
A — CO 

.2(&) = max{^-6+L^J , h-{3-u;)[^\}. (3) 

S2 = mm{s2{b) : [l^\<b<h-2}. (4) 

t(LJ, /i) = minjsi, S2}. (5) 
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By using fast rectangular matrix multiplication, an alternative definition for t{uj,h), resulting in 
slightly smaller values, can be obtained (note that if w = 2, as conjectured by many researchers, 
fast rectangular matrix multiplication has no advantage over fast square matrix multiplication). 

Theorem 1.2 Let H be a fixed graph with h vertices. Let G = {V,E) be a graph with n vertices, 
and w : V ^ ^ a weight function. For every pair of vertices u,v € V , an induced H -subgraph 
of G containing u and v of maximum weight (if one exists), can be found in 0{n}^^''^^) time. In 
particular, the MAX H -subgraph problem can be solved in 0(n*('^'^)) time. 

Notice that Theorem 11.21 solves, in fact, a more general problem, the All-Pairs MAX //-Subgraph 
problem. It is easy to establish some small values of t{uj,h) directly. For /i = 3 we have t{uj,3) = 
2 + 1/(4 — uj) < 2.616 by taking 6i = 1 in Using fast rectangular matrix multiplication this 
can be improved to 2.575. In particular, for each pair of vertices, a triangle of maximum weight 
containing them (if one exists) can be found in o(n^'^^^) time. This should be compared to the well- 
known 0{n'^) < o(n^'^^^) time algorithm for detecting a triangle in an unweighted graph |IR,78j . 
For /i = 4 we have t{ui,A) = a; + 1 < 3.376 by taking 6 = 2 in Interestingly, the fastest 
algorithm for detecting a K4, that uses square matrix multiplication, also runs in 0(n'^+i) time 
|NP85j . The same phenomena also occurs for h = 5 where t{u!, 5) = u + 2 < 4.376 and for h = 8 
where t{uj,8) = 2u; + 2 < 6.752. We also note that t(a;,6) = 4 + 2/(4-a;), t(w,7) = 4 + 3/(4 -w), 
t(u;,9) = 2u; + 3 and t(a;, 10) = 6 + 4/(4 — lu). However, a closed formula for t{uj,h) cannot be 
given. Already for h = 11, and for infinitely many values thereafter, t{uj, h) is only piecewise 
linear in u). For example, if 7/3 < lij < 2.376 then t{io, 11) = 3w + 2, and if 2 < lij < 7/3 then 
t{uj, 11) = 6 + 5/(4 — uj). Finally, it is easy to verify that both si in © and S2 in Q converge to 
2>h/ (6 — id) as h increases. Thus, t(a;, h) converges to 3/i/ (6 — w) < 0.828/i as h increases. 

Prior to this work, the only known algorithm for MAX iZ-SUBGRAPH in the vertex-weighted 
case (moreover, the All-Pairs version of the problem) was the naive 0{n^) algorithm. In general, 
reductions to fast matrix multiplication tend to fail miserably in the case of real-weighted graph 
problems. The most prominent example of this is the famous All-Pairs Shortest Paths (APSP) 
problem. Seidel |Sei95j and Galil and Margalit |GM97| developed 0{n^) algorithms for undirected 
unweighted graphs. However, for arbitrary edge weights, the best published algorithm known 
is a recent 0(n^/ log n) by Chan |Ch05j . When the edge weights are integers in [— M, M], the 
problem is solvable in 0(Mn'^) by Shoshan and Zwick |?^Z99| . and 0(M°-6Sin2-575) by Zwick |Zwn2| . 
respectively. Earlier, a series of papers in the 70's and 80's starting with Yuval |Yu76j attempted 
to speed up APSP directly using fast matrix multiplication. Unfortunately, these works require a 
model that allows infinite-precision operations in constant time. 

A slight modification in the algorithm of Theorem II. 2( without increasing its running time by 
more than a logarithmic factor, can also answer the decision problem: "for every pair of vertices 
u,v, is there an //-subgraph containing u and v, whose weight is in the interval [t(;i,ti;2] where 
wi < W2 are two given reals?" Another feature of Theorem 11.21 is that it makes a relatively small 
number of comparisons. For example, a heaviest triangle can be found by the algorithm using only 
0{m + nlogn) comparisons, where m is the number of edges of G. 
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Since Theorem ll.2l is stated for induced //-subgraphs, it obviously also applies to not-necessarily 
induced //-subgraphs. However, the latter problem can, in some cases, be solved faster. For 
example, we show that the o{n?'^^^) time bound for finding a heaviest triangle also holds if one 
searches for a heaviest //-subgraph in the case when H is the complete bipartite graph K2^k- 

Several //-subgraph detection algorithms take advantage of the fact that G may be sparse. 
Improving a result of Itai and Rodeh |IR78j . Alon, Yuster and Zwick obtained an algorithm for 
detecting a triangle, expressed in terms of m |AYZ97j . The running time of their algorithm is 
Q(^^2uj/(uj+i)-^ < o{m^-'^^). This is faster than the ©(n"^) algorithm when m = o(n('^+^)/2). The best 
known running times in terms of m for H = when /c > 4 are given in |EGn4j . Sparseness can 
also be used to obtain faster algorithms for the vertex-weighted MAX //-SUBGRAPH problem. 
We prove: 

Theorem 1.3 If G = (y,E) is a graph with m edges and no isolated vertices, and w : V ^ ^ 
is a weight function, then a triangle of G with maximum weight (if one exists) can be found in 

The proofs of Theorems 11.21 and and some of their consequences, appear in Section 3. 
In Section 2 we first introduce a general method called dominance computation, motivated by a 
problem in computational geometry and introduced by Matousek in jMaOlj . and show how it can 
be used to obtain a truly sub-cubic algorithm for the MAX /Cs-SUBGRAPH problem. Although 
the running time we obtain using this method is slightly inferior to that of Theorem 11.21 we show 
that this method has other very interesting applications. In fact, we will show how to use it in 
order to efficiently solve a general buyer-seller problem from computational economics. Another 
interesting application of the method is the ability to compute the the most significant bits of the 
distance product A-k B oi two real matrices, in truly sub-cubic time (see the definition of distance 
products in the next section) . Computing the distance product quickly has long been considered as 
the key to a truly sub-cubic APSP algorithm, since it is known that the time complexity of APSP 
is no worse than that of the distance product of two arbitrary n x n matrices. 

We now turn to edge-weighted graphs. An 0(m^~-'^/rfc/^T ) time algorithm for detecting the 
existence of a cycle of length k is given in |AYZ97j . A small improvement was obtained later 
in |YZ04j . However, the algorithms in both papers fail when applied to edge- weighted graphs. 
Using the color coding method, together with several additional ideas, we obtain a randomized 
0(rn-^~^/ rfc/2l ^ time algorithm in the edge-weighted case, and an 0(m^~-'^/r^/^T logn) deterministic 
algorithm. 

Theorem 1.4 Let k > 3 be a fixed integer. If G = (y,E) is a graph with m edges and no 
isolated vertices, and w : E ^ ^ is a weight function, then a maximum weight cycle of length k, 
if one exists, can be found with high probability in 0{rn?~^/^^^'^'^) time, and deterministically in 

o(m2-vrfc/2i log 

n) time. 

In a recent result of Chan |Chn5j it is shown that the distance product of two nxn matrices with 
real entries can be computed in 0(n^/logn) time (again, reals are only allowed to be compared 



4 



or added; more recently, Y. Han announced an 0(n'^(log log n/ log n)^/^) time algorithm). We 
show how to reduce the MAX ff-SUBGRAPH problem in edge-weighted graphs to the problem of 
computing a distance product. 

Theorem 1.5 Let H be a fixed graph with h vertices. If G = {V,E) is a graph with n vertices, 
and w : E ^ ^ is a weight function, then an induced H -subgraph of G (if one exists) of maximum 
weight can be found in 0{n^ / logn) time. 

We can strengthen the above result considerably, in the case where H is a cycle. For (not- 
necessarily induced) cycles of fixed length we can combine distance products with the color coding 
method and obtain: 

Theorem 1.6 Let k be a fixed positive integer. If G = {V,E) is a graph with n vertices, and 
w : E ^ ^ is a weight function, a maximum weight cycle with k vertices (if exist) can be found, 
with high probability, in 0{n^ / logn) time. 

In fact, the proof of Theorem 11.61 shows that a maximum weight cycle with k = o(loglogn) ver- 
tices can be found in (randomized) sub-cubic time. Section 4 considers edge-weighted graphs and 
contains the algorithms proving Theorems II. 4( 11.51 and 11.61 

Finally, we consider the related problem of finding a certain chromatic i^-subgraph in an edge- 
colored graph. We consider the two extremal chromatic cases. An if-subgraph of an edge-colored 
graph is called rainbow if all the edges have distinct colors. It is called monochromatic if all the edges 
have the same color. Many combinatorial problems are concerned with the existence of rainbow 
and/or monochromatic subgraphs. 

We obtain a new algorithm that finds a rainbow i7-subgraph, if one exists. 

Theorem 1.7 Let H be a fixed graph with 3k + j vertices, j € {0, 1, 2}. If G = (V, E) is a graph 
with n vertices, and c : E ^ C is an edge- coloring, then a rainbow H -subgraph of G (if one exists) 
can be found in 0{n^^^^ logn) time. 

The running time in Theorem 11.71 matches, up to a logarithmic factor, the running time of the 
induced /f-subgraph detection problem in (uncolored) graphs. 

We obtain a new algorithm that finds a monochromatic -ff-subgraph, if one exists. For fixed 
H, the running time of our algorithm matches the running time of the (uncolored) i?-subgraph 
detection problem, except for the case H = K^. 

Theorem 1.8 Let H be a fixed connected graph with 2>k + j vertices, j G {0, 1,2}. If G = {V,E) is 
a graph with n vertices, and c : E ^ C is an edge- coloring, then a monochromatic H -subgraph of 
G (if one exists) can be found in 0{n'^^~^^) time, unless H = K^. A monochromatic triangle can 
be found in 0(n(3+'^)/2) < ^^^2.fsm^ ^^^^^ 

The algorithms for edge-colored graphs yielding Theorems 11.71 and 11.81 appear in Section 5. The 
final section contains some concluding remarks and open problems. 
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2 Dominance computations 



Given a set of points {vi, . . . , Vn} in M , the dominating pairs problem is to find all pairs of points 
{vi,Vj) such that for all k = 1, d, Vi[k] < Vj[k]. The key insight to our method is a connection 
between the problem of finding triangles and the well-known problem of computing dominating 
pairs in computational geometry. This connection was inspired by recent work of Chan |Ch05j , who 
demonstrated how a 0{c'^n^^'^ + n'^) algorithm for computing dominating pairs in d dimensions can 
be used to solve the arbitrary APSP problem in 0(n^/logn) time. 

In particular, we use an elegant algorithm by Matousek for computing dominating pairs in n 
dimensions |Ma91j . Matousek's algorithm does a bit more than determine dominances — it actually 
computes a matrix D such that 

D[i,j] = \{k I Vi[k] < vj[k]}\. 
We will call D the dominance matrix in the following. 

Theorem 2.1 (Matousek i !Ma91| ^ Given a set S of n points in W^, the dominance matrix for 
S can be computed in O (^n~^^ time. 

We outline Matousek's approach in the following paragraphs. For each coordinate j = 1, . . . ,n, 
sort the n points by coordinate j. This takes O(n^logn) time. Define the jth rank of point Vi, 
denoted as rj{vi), to be the position of Vi in the sorted list for coordinate j. 

For a parameter s € [log n,n], make n/s pairs of Boolean matrices {Ai, Bi) . . . , {An/s, Bn/s) 
defined as follows: 

Ak[i,j] = 1 rj{vi) G [ks,ks + s), 

Bk[i,j] = 1 rj{vi) >ks + s. 

Now, multiply A^ with obtaining a matrix Ck- Then Ck[i,j] equals the number of coordi- 
nates c such that Vi[c\ < fj[c], rdvi) G [ks, ks + s), and rj{vi) >ks + s. 
Therefore, letting 

n/s 

c = ^ Ck-, 

k=l 

we have that C[i,j] is the number of coordinates c such that lrc{vi)/s\ < \jc{vj)/s\. 

Suppose we compute a matrix E such that E[i,j] is the number of c such that Vi[c\ < Vj[c\ and 
[i^c{'Vi)/s\ = [j'c{vj)/s\. Then, defining D := C + E , we have the desired matrix 

D[i,j] = \{k I Vi[k] < vj[k]}\. 

To compute E, we use the n sorted lists. For an integer n, define [n] = {1, . . . ,n}. Then, for 
each pair (i, j) G [n] x [n], we look up fj's position p in the sorted list for coordinate j. By reading 
off the adjacent points less than Vi in this sorted list (i.e. the points at positions p — 1, p — 2, 
etc.), and stopping when we reach a point Vk such that lrj{vk)/s\ < lrj{vi)/s\, we obtain the list 
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Vi^,. . . of £ < s points such that, for all a; = 1, . . . , ^, Vi^[j] < Vi[j] and [rj{vi)/s\ = [rj{vi^)/s\. 
Finally, for each x = 1, ...,£, we add a 1 to E[ix,i]. Assuming constant time lookups and constant 
time probes into a matrix, this entire process takes only O(n^s) time. 

The running time of the above procedure is 0{'n?s + ju^)- Choosing s = n~2~ , the time bound 
becomes O ^n"^ j . 

2.1 Finding a heaviest triangle in sub-cubic time 

We first present a weakly polynomial deterministic algorithm, then a randomized strongly polyno- 
mial algorithm. 

Theorem 2.2 On graphs with integer weights, a maximum vertex-weighted triangle can be found 
in 0(n('^"^^)/^ • logiy) time, where W is the maximum weight of a triangle. On graphs with real 
weights, a maximum vertex-weighted triangle can be found in 0{n^^~^'^^^'^ ■ B) time, where B is the 
maximum number of hits in a weight. 

Proof: The idea is to obtain a procedure that, given a parameter K, returns an edge {i,j) from 
a triangle of weight at least K. Then one can binary search to find the weight of the maximum 
triangle, and try all possible vertices k to get the triangle itself. 

We first explain the binary search. Without loss of generality, we assume that all edge weights 
are at least 1. Let W be the maximum weight of a triangle. Start by checking if there is a triangle 
of weight at least = 1 (if not, there are no triangles). Then try K = 2^ for increasing i, until 
there exists a triangle of weight 2* but no triangle of weight 2*+^. This i will be found in 0(log W) 
steps. After this, we search on the interval [2*, 2*+^) for the largest K such that there is a triangle 
of weight K. This takes O(logM^) time for integer weights, and 0{B) time for real weights with B 
bits of precision. 

We now show how to return an edge from a triangle of weight at least K, for some given K. 
Let y = {1, . . . , n} be the set of vertices. For every i G F, we make a point = (e(l), . . . , e(n)), 
where 

K — w{i) if there is an edge from i to j 
oo otherwise. 



(In implementation, we can of course substitute a sufficiently large value in place of oo.) We also 
make a point gi = (e'(l), . . . , e'(n)), where 



e'(j) 



w{i) -\- w{j) if there is an edge from i to j 
— oo otherwise. 



Compute the dominance matrix D{K) on the sets {/j} and {gi}- For all edges in the graph, 
check if there exists a k such that /j [k] < gj [k] . This can be done by examining the appropriate 
entry in D{K). If such a k exists, then we know there is a vertex k such that 

K — w{i) < w{j) + w{k) K < w{i) + w{k) + w{j), 
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that is, there exists a triangle of weight at least K using edge (i, j). Observe that the above works 
for both directed and undirected graphs. □ 

In the above, the binary search over all possible weights prevents our algorithm from being 
strongly polynomial. We would like to have an algorithm that, in a comparison-based model, has a 
runtime with no dependence on the bit lengths of weights. Here we present a randomized algorithm 
that achieves this. 

Theorem 2.3 On graphs with real weights, a maximum vertex-weighted triangle can be found in 
Q^j^{(^+3)/2 .jogn) expected worst-case time. 

We would like to somehow binary search over the collection of triangles in the graph to find the 
maximum. As this collection is O(n^), we would then have our strongly polynomial bound. Ideally, 
one would like to pick the "median" triangle from a list of all triangles, sorted by weight. But as 
the number of triangles can be Q{n^), forming this list is hopeless. Instead, we shall show how 
dominance computations allow us to efficiently and uniformly sample a triangle at random, whose 
weight is from any prescribed interval (VFi, VF2)- If we pick a triangle at random and measure its 
weight, there is a good chance that this weight is close to the median weight. In fact, a binary 
search that randomly samples for a pivot can be expected to terminate in O(logra) time. 
Let Wi, W2 G M U {—00, 00}, Wi < W2, and G be a vertex-weighted graph. 

Definition 2.4 C(Wi,W2) is defined to be the collection of triangles in G whose total weight falls 
in the range [VFi, ^2]- 

Lemma 2.5 One can sample a triangle uniformly at random from C{Wi, W2), in 0{n^^^^^^'^) time. 

Proof: From the proof of Theorem 12. 21 one can compute a matrix D{K) in 0(n^'^^^-*''^) time, such 
that D{K)[i.,j\ 7^ iff there is a vertex k such that (z, k) and (/c, j) are edges, and w{i) + w{j) + 
w{k) > K. In fact, the i,j entry of D{K) is the number oi distinct vertices k with this property. 

Similarly, one can compute matrices E{K) and L{K) such that E{K)[i,j] and L{K)[i,j] contain 
the number of vertices k such that (i, k) and {k,j) are edges, and w{i) + w{j) + w{k) < K (for E) 
or w{i) + w{j) + w{k) < K (for L). (This can be done by flipping the signs on all coordinates in the 
sets of points {/«} and {gi} from Theorem 12. 21 then computing dominances, disallowing equalities 
for L.) 

Therefore, if we take F = E{W2) — L{Wi)^ then F[i,j] is the number of vertices k where there 
is a path from z to A; to j, and w{i) + w{j) + w{k) e [Wi, VF2]. 

Let / be the sum of all entries F[i,j]. For each G E, choose {i,j) with probability F[i,j]/f. 
By the above, this step uniformly samples an edge from a random triangle. Finally, we look at the 
set of vertices S that are neighbors to both i and j, and pick each vertex in S with probability 
1^. This step uniformly samples a triangle with edge {i,j). The final triangle is therefore chosen 
uniformly at random. □ 

Observe that there is an interesting corollary to the above. 
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Corollary 2.6 In any graph, one can sample a triangle uniformly at random in 0{n^) time. 

Proof: (Sketch) Multiplying the adjacency matrix with itself counts the number of 2-paths from 
each vertex to another vertex. Therefore one can count the number of triangles and sample just as 
in the above. □ 

We are now prepared to give the strongly polynomial algorithm. 
Proof of Theorem 12. 3t Start by choosing a triangle t uniformly at random from all triangles. 
By the corollary, this is done in 0{n^) time. 

Measure the weight W of t. Determine if there is a triangle with weight in the range (W, oo), 
in ©(n^"^"*"^)/^) time. If not, return t. If so, randomly sample a triangle from {W, oo), let W' be its 
weight, and repeat the search with W'. 

It is routine to estimate the runtime of this procedure, but we include it for completeness. Let 
T(n, k) be the expected runtime for an n vertex graph, where k is the number of triangles in the 
current weight range under inspection. In the worst case. 



k-l 

T{n, k)<-Y^ T{n, k - i) + c ■ n('^+3)/2 



i=l 

for some constant c > 1. But this means 

fc-2 



T(n, k-l)< y T(n, k - i) + c ■ 

k — 1 ^-^ 

4=1 



SO 



T{n,k) < [l + ^-iyT{n,k-l) 
+ ( 1 - ^] cn(-+3)/2 



= r(n,A;-l) + ^n('^+3)/2, 
k 

which solves to T{n, k) = 0(n('^+^)/2 log k). □ 



2.2 Most significant bits of a distance product 

Let A and B be two n x n matrices with entries in K U oo. The distance product C = A * i? is an 
nxn matrix with C[i,j] = mink=i...^nA[i,k] + B[k,j]. Clearly, C can be computed in O(n^) time 
in the addition-comparison model. In fact, Kerr |Ke7nj showed that the distance product requires 
il(n^) on a straight-line program using + and min. However, Predman showed in |Fr76j that the 
distance product of two square matrices of order n can be performed in 0(n'^(loglogn/logn)^/'^) 
time. Following a sequence of improvements over Fredman's result, Chan gave an 0(n^/ log n) time 
algorithm for distance products. 
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Computing the distance product quickly has long been considered as the key to a truly sub-cubic 
APSP algorithm, since it is known that the time complexity of APSP is no worse than that of the 
distance product of two arbitrary nx n matrices. Practically all APSP algorithms with runtime of 
the form 0(n") have, at their core, some form of distance product. Therefore, any improvement 
on the complexity of distance product is interesting. 

Here we show that the most significant bits of A-k B can be computed in sub-cubic time, again 
with no exponential dependence on edge weights. In previous work, Zwick |Zw02j shows how to 
compute approximate distance products. Given any e > 0, his algorithm computes distances dij 
such that the difference of dij and the exact value of the distance product entry is at most 0{e) . The 
running time of his algorithm is ■ n'^ logW). Unfortunately, guaranteeing that the distances 

are within e of the right values, does not necessarily give any of the bits of the distances. Our 
strategy is to use dominance matrix computations. 

Proposition 2.7 Let A,B G (ZD {+00, — cxd})"^"". The k most significant bits of all entries in 
A-k B can be determined in 0(2^ ■ n^2~ logn) time, assuming a comparison-based model. 

Proof: For a matrix M, let M[i, :] be the ith row, and M[:, j] be the jth column. For a constant 
K, define the set of vectors 

M^{K) := {{M[i, 1]-K,..., M[i, n] - K) \i = l,...,n}. 

Also, define 

M«(i^) := {(-M[l, i],..., -M[n, i]) \ i = 1, . . . ,n}. 

Now consider the set of vectors S{K) = A^{K) U B^{K). Using a dominance computation on 
S{K), one can obtain the matrix C{K) defined by 



CiK)[i,j] :-- 
Then for any i,j, 



if 3k s.t. Ui[k] < Vj[k],Ui G A^{K),Vj G B^{K) 

1 otherwise 



mm{A[i,k] +B[k,j]} > K C{K)[i,j] = 1. 

k 

Let W be the smallest power of 2 larger than maxjj{^[i, j]} -|- maxjj{i3[i, j]}. Then C(^) gives 
the most significant bit of each entry m. Ak;B. To obtain the second most significant bit, compute 
C(-^) and C(^). The second bit of {A-k B)[i, j] is given by the expression: 

hC{W)[i,j] A i]) V (-C(f )[z, j] A C(f j]). 

In general, to recover the first k bits of {A-k B), one computes C(-) for 0{2^) values of K. In 
particular, to obtain the ^-th bits, compute 



' V \-C{W{l - -^)) A C{W{1 - - 1))]. 

s=0 
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To see this, notice that for a fixed s, if (for any i,j G [n]) 



W{l-^-^,)< mmA[i, k] + B[k,j] < W{1 - 

then the i-th bit of min^ A[i, k] + B[k,j] must be 1, and if the i-th bit of min^ A[i, k] + B[k,j] is 1, 
then there must exist an s with the above property. 

The values for C{W{1 — 2^^)) fo^" even s are needed for computing the {£ — l)-st bits, hence to 
compute the £-tli bits, at most 2^^^ + 2^~^ dominance computations are necessary. To obtain the 
first k bits of the distance product, one needs only 0(2^) dominance product computations. □ 

2.3 Buyer-Seller stable matching 

We show how the "dominance-comparison" ideas can be used to improve the runtime for solving 
a problem arising in computational economics. In this problem, we have a set of buyers and a set 
of sellers. Each buyer has a set of items he wants to purchase, together with a maximum price for 
each item which he is willing to pay for that item. In turn, each seller has a set of items she wishes 
to sell, together with a reserve price for each item which she requires to be met in order for the 
sale to be completed. Formally: 

Definition 2.8 An (n, A;) -Buyer-Seller instance consists of 

• a set C = {1, . . . , k} of commodities ^ 

• an n-tuple of buyers B = where hi = {Bi,pi), s.t. B^ C. C are the commodities 
desired by buyer i, and pi : Bi ^ R"^ is the maximum price function for buyer i 

• an n-tuple of sellers S = {si, . . . , Sn} where Si = {Si,Vi), s.t. Si C. C are the commodities 
owned by seller i, and Vi : Si ^ is the reserve price function for seller i 

A sale transaction for an item / between a seller who owns / and a buyer who wants / can take 
place if the price the buyer is willing to pay is at least the reserve price the seller has for the item. 
Let us imagine that each buyer wants to do business with only one seller, and each seller wants to 
target a single buyer. Then the transaction between a buyer and a seller consists of all the items 
for which the buyer's maximum price meets the seller's reserve price. 

Definition 2.9 Given a buyer {Bi,pi) and a seller {Sj,Vj) the transaction set Cij is defined as 
Cij = {l\ I & Bi n Sj, Pi{l) > Vj{l)}. Denote by C the transaction matrix with entries \Cij\. 
The price of Cij is defined as Pij = ^i^c Pii^)' ^''^^ reserve of Cij is defined as Rij = 
S;eC Denote by P and R respectively the transaction price and reserve matrices with 

entries Pij and Rij. 

^We will use the words "commodities" and "items" interchangeably. 
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Further, we assume that every buyer i has a preference relation on the sellers j which depends 
entirely on Pij,Rij and \Cij\. Conversely, every seller has a preference relation on the buyers 
determined by the same three values. More formally. 



j prefers buyer i to buyer i' iff gj{Pij,Rij, \Cij\) > gj{Pi'j,Riij, \Ci'j\). 

Ideally, each buyer wants to talk to his most preferred seller, and each seller wants to sell to 
her most preferred buyer. Unfortunately, this is not always possible for all buyers, even when the 
prices and reserves are all equal, and all preference functions equal \Cij\. This is evidenced by the 
following example: Buyer 1 wants to buy item 2, buyer 2 wants to buy items 1 and 2, seller 1 has 
item 1, seller 2 has items 1 and 2. Here buyer 1 will not be able to get any items. 

In a realistic setting, we want to find a buyer-seller matching so that there is no pair (6j, Sj) for 
which bi is not paired with Sj, such that both bi and sj would benefit from breaking their matches 
and pairing among each other. This is the stable matching problem, for which optimal algorithms 
are known when the preferences are known {e.g., Gale-Shapley |GS62j can be implemented to run 
in 0(n^)). However, for large k, the major bottleneck in our setting is that of computing the 
preference functions of the buyers and sellers. 

The obvious approach to compute Pij, Rij and \Cij\ is to explicitly find the sets Cij. This gives 
an 0{kv?) algorithm to compute Pij,Rij and \Cij\ for all pairs 

Let for a (computable) function /, Tf be the maximum time, over all re-bit p, r and c, needed 
to compute /(p, r, c). Let T be the maximum time Tj, over all preference functions fi and gj for a 
buyer-seller instance. Then in time 0{kin? + Tre^ + re^ log re) one can obtain for every buyer (seller) 
a list of the sellers (buyers) sorted by the buyer's (seller's) preference function. Exploiting fast 
dominance computation, we can do better. 

Theorem 2.10 The matrices P, R and C for an (n, k) -Buyer-Seller instance can be determined 
in 0{n^kM{n, k)) time, where M{n, k) is the time required to multiply an nx k by a kxn matrix. 

Proof: Using the dominance technique, we can compute matrix C as follows. For each buyer 
i we create a A;-dimensional vector f3i = . . . , /Sik} so that (3ij = Pi{j) if j € Cj, and (3ij = — oo if 
j ^ Cj. For each seller i we create a A;-dimensional vector Uj = {cjji, . . . ,(Tik} so that cjjj = Vi{j) if 
j £ Si, and cjjj = cxd if j ^ 5j . Computing the dominance matrix for these points computes exactly 
the number of items / which buyer i wants to buy, seller j wants to sell, and Pi{i) > Vj{i). 

By a modification of Matousek's algorithm for computing dominances, we can also compute the 
matrices P and R. We demonstrate how to find R. Recall that the dominance algorithm does a 
matrix multiplication Af^ ■ with entries = 1 iff rj{bi) € [ks,ks -\- s), and B]^[i,j] = 1 iff 

i^ji^i) > ks -\- s (using the notation from Theorem 12. 1() . Let Bf. be the same, but redefine to be 



• buyer i has a (computable) preference function fi : x x TL such that i prefers 
seller j to seller j' iff fi{Pij,Rij, \Cij\) > fi{Pij',Rij', IQ/I). 

• Similarly, seller j has a (computable) preference function gj : x R^" x % such that 
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Similar modifications are made to the computation of the matrix E. Instead of adding 1 to the 
matrix entry E[ix,i] in the step for coordinate j, we add the corresponding reserve price Vi^{j). 
Determining P can be done analogously. □ 



Corollary 2.11 A buyer-seller stable matching can be determined in 0{ny/kM{n, k) + logn + 
n^T), where T is the maximum time to compute the preference functions of the buyers /sellers, 



given the buyer price and seller reserve sums for a buyer-seller pair. 

For instance, if A; = n and T = 0(polylog n), the runtime of finding a buyer-seller stable 
matching is 0(n^~) = 0{n^'^^^). 

3 Heaviest iif-subgraphs of real vert ex- weighted graphs 

In the proof of Theorem ll.2l it would be convenient to assume that H = is a clique on h vertices. 
The proof for all other induced subgraphs with h vertices is only slightly more cumbersome, but 
essentially the same. 

Let G = (y,E) be a graph with real vertex weights, and assume V = {1, . . . ,n}. For two 
positive integers a, b, the adjacency system A{G, a, b) is the 0-1 matrix defined as follows. Let Sx 
be the set of all (") x-subsets of vertices. The weight w{U) of C/ G Sx is the sum of the weights of 
its elements. We sort the elements of Sx according to their weights. This requires 0{n^ logn) time, 
assuming x is a constant. Thus, Sx = {Ux^i, . . . ^n^} where w{Ux,i) < w{Ux,i+i)- The matrix 
A{G, a, b) has its rows indexed by Sa- More precisely, the j'th row is indexed by Uaj- The columns 
are indexed by Sh where the j'th column is indexed by C/ftj. We put a, 6)[?7, [/'] = 1 if and 
only if [/ U [/' induces a Ka+h in G (this implies that C/ n t/' = 0), otherwise A{G, a, b)[U, U'] = 0. 
Notice that the construction of A{G, a, b) requires 0{n°'~^^) time. 

For positive integers a, 6, c, so that a + 6 + c = /i, consider the Boolean product A{G, a, 6, c) = 
A{G,a,b) X A{G,b,c). For U Sa and U' £ Sc for which A{G,a,b,c)[U,U'] = 1, define their 
maximum witness 6(U, U') to be the maximal element U" G Sh for which A{G, a, b)[U, U"] = 1 and 
also A{G,b,c)[U",U'] = 1. For each U e Sa and U' G Sc with A{G,a,b,c)[U,U'] = 1 and with 
U UU' inducing a Ka+c, if U" = 6{U,U') then U UU' U U" induces a Kh in G whose weight is the 
largest of all the copies of G that contain U UU' . This follows from the fact that Sb is sorted. 
Thus, by computing the maximum witnesses of all plausible pairs U £ Sa and U' G Sc we can find, 
for each pair of vertices, a Kh in G with maximum weight containing them, if such a Kh exists, or 
else determine that no iT/t-subgraph contains the pair. 

Let A = Anj^xn2 and B = Bn2xns be two 0-1 matrices. The maximum witness matrix of AB is 
the matrix W = Wn-i^xna defined as follows. 




the maximum k s.t. A[i, k] = B[k,j] = 1 if {AB)[i,j] ^ 0, 
otherwise. 



13 



Let /(?T.i, ^2, 71.3) be the time required to compute the maximum witness matrix of the product of 
an rii x n2 matrix by an 722 x matrix. Let /i > 3 be a fixed positive integer. For all possible 
choices of positive integers a, b, c with a + b -\- c = h denote 

f{h,n) = min f{n"',n^,rf). 

a+b+c=h 

Clearly, the time to sort Sh and to construct A{G, a, b) and A{G, b, c) is overwhelmed by 
f {n"" , n'' , n^) . It follows from the above discussion that: 

Lemma 3.1 Let h > 3 be a fixed positive integer and let G = (V, E) be a graph with n vertices, each 
having a real weight. For all pairs of vertices u,v G V , an induced H -subgraph of G containing 
u and V of maximum weight (if one exists), can be found in 0{f{h,n)) time. Furthermore, if 
f{n"',n^,n'^) = f{h,n) then the number of comparisons needed to find a maximum weight is 
0{n^ log n + z{G, a + c)) where z{G, a + c) is the number of Ka+c in- G- 

In fact, if 6 > 2, the number of comparisons in Lemma b.ll can be reduced to only 0(n^+z(G, a+c)). 
Sorting Sh reduces to sorting the sums X + X + . . . + X {X repeated b times) of an n-element set 
of reals X. Fredman showed in |lr76aj that this can be achieved with only 0{n}') comparisons. 

A simple randomized algorithm for computing (not necessarily maximum) witnesses for Boolean 
matrix multiplication, in essentially the same time required to perform the product, is given 
by Seidel |Sei95j . His algorithm was derandomized by Alon and Naor jAN96j . An alternative, 
somewhat slower deterministic algorithm was given by Galil and Margalit |GM93j . However, 
computing the matrix of maximum witnesses seems to be a more difficult problem. Improv- 
ing an earlier algorithm of Bender et al. BFPSSOQ, Kowaluk and Lingas |KL05j show that 
/(3,n) = 0(n2+i/(4-'^)) < o(„2.6i6)^ ^pj^^g already yields the case /i = 3 in Theorem O We 
will need to extend and generalize the method from |KL05j in order to obtain upper bounds for 
f{h,n). Our extension will enable us to answer more general queries such as "is there a Kh whose 
weight is within a given weight interval?" 

Proof of Theorem ll.2t Let /i > 3 be a fixed integer. Suppose a, b, c are three positive integers 
with a + b+c = h and suppose that < /x < 6 is a real parameter. For two 0-1 matrices A = A^a^^^ 
and B = B^b-^^c the fi-split of A and B is obtained by splitting the columns of A and the rows 
of B into consecutive parts of size \n^~\ or [n^J each. In the sequel we ignore floors and ceilings 
whenever it does not affect the asymptotic nature of our results. This defines a partition of A 
into p = n^~^ rectangular matrices Ai, . . . , Ap, each with rows and n'^ columns, and a partition 
of B into p rectangular matrices Bi, . . . , Bp, each with rows and columns. Let Cj = AiBi 
for i = 1, . . . ,p. Notice that each element of Cj is a nonnegative integer of value at most n'^ and 
that AB = X;r=i Ci. Given the Q, the maximum witness matrix W of the product AB can be 
computed as follows. To determine j] we look for the maximum index r for which Cr[i,j] 7^ 0. 
If no such r exists, then iy[i, j] = 0; otherwise, having found r, we now look for the maximal index 
k so that Ar[i, k] = Ar[k,j] = 1. Having found k we clearly have j] = (r — l)n^ + k. 

We now determine a choice of parameters a, b, c, /i so that the time to compute Ci, . . . ,Cp and 
the time to compute the maximum witnesses matrix W, is 0(n*^'^'^^). By Lemma l3. 11 this suffices 
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in order to prove the theorem. We will only consider ^ < mm{a,b,c}. Taking larger values of 
/i results in worse running times. The rectangular product d can be computed by performing 
0{n"'~^n^~^) products of square matrices of order n'^. Thus, the time required to compute Ci is 

Since there are p such products, and since each of the n'^'^'^ witnesses can be computed in 0{p + n^) 
time, the overall running time is 

Optimizing on fj, we get fi = 6/(4 — w). Thus, if, indeed, 6/(4 — 0;) < min{a, c} then the time needed 
to find W is 0(n''-^+''/(4-^)). Of course, we would like to take 6 as large as possible under these 
constraints. Let, therefore, 6i be the largest integer 6 so that 6/(4 — a;) < [(/i — 6)/2j. For such a 
6i we can take a = [{h — 6i)/2j and c = \{h — 6i)/2] and, indeed, fi < min{a,c}. Thus, (jH)) gives 
that the running time to compute W is 

This justifies si appearing in ((2j in the definition of t(uj,h). There may be cases where we can 
do better, whenever 6/(4 — a;) > min{a,c}. We shall only consider the cases where a = fi = 
l{h — 6)/2j < 6 (other cases result in worse running times). In this case c = \{h — b)/2\ and, using 
(jSJ, the running time is 

0(n'^-(3— )L¥^J +n^-''+LVJ). 

This justifies S2 appearing in @ in the definition of t{uj,h). Since t(uj,h) = min{si,S2} we have 
proved that W can be computed in 0{n^^'^'^^) time. □ 

As can be seen from Lemma l3.1l and the remark following it, the number of comparisons that 
the algorithm performs is relatively small. For example, in the case h = 3 we have a = 6 = c = 1 
and hence the number of comparisons is 0(nlogn + m). In all the three cases h = 4,5,6 the value 
6 = 2 yields t{u!, h). Hence, the number of comparisons is 0{n'^) for /i = 4, 0(n^ + mn) for h = 5 
and 0{n^ + m?) for /i = 6. 

Suppose w : {\,. . . ^ 3? so that w{k.) < w{k + 1). The use of the /i-split in the proof of 
Theorem 11.21 enables us to determine, for each i,j and for a real interval whether or not 

there exists an index k so that A[i, k] = B[k,j] = 1 and w{k) € This is done by performing 

a binary search within the p = vP'^ matrices Ci, . . . ,Cp. The running time in (jB)) only increases 
by a log n factor. We therefore obtain the following corollary. 

Corollary 3.2 Let H he a fixed graph with h vertices, and let I d If G = {V, E) is a graph 
with n vertices, and w : V ^ ^ is a weight function, then, deciding whether G contains an induced 
H -subgraph with total weight in I can be done 0{n^^^'^^ logn) time. 
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Proof of Theorem II. 3t We partition the vertex set V into two parts V = X UY according 
to a parameter A. The vertices in X have degree at most A. The vertices in Y have degree larger 
than A. Notice that \Y\ < 2m/ A. In 0{mA) time we can scan all triangles that contain a vertex 
from X. In particular, we can find a heaviest triangle containing a vertex from X. By Theorem 
11.21 a heaviest triangle induced by Y can be found in 0((m/A)*('^'^)) = 0((m/A)^+^/(^~'^)) time. 
Therefore, a heaviest triangle in G can be found in 



The results in Theorems 11.21 and 11.31 are useful not only for real vertex weights, but also when 
the weights are large integers. Consider, for example, the graph parameter P{G,H), the H edge- 
covering number oi G. We define P{G,H) = if G has no i/-subgraph, otherwise P{G,H) is the 
maximum number of edges incident with an if-subgraph of G. To determine P{G,Kk) we assign 
to each vertex a weight equal to its degree. We now use the algorithm of Theorem 11.21 to find 
the heaviest Kf^. If the weight of the heaviest K/^ is w, then I3{G,K}^) = w — i^- In particular, 
f3{G,KjS) can be computed in 0{n^'^^'^^) time. 

Finally, we note that Theorems 11.21 and 11.31 apply also when the weight of an if-subgraph is 
not necessarily defined as the sum of the weights of its vertices. Suppose that the weight of a 
triangle (x, y, z) is defined by a function /(x, y, z) that is monotone in each variable separately. For 
example, we may consider /(x, y, z) = xyz, f{x, y, z) = xy + xz + yz etc. Assuming that /(x, y, z) 
can be computed in constant time given x, y, z, it is easy to modify Theorems 11.21 and 1 1 . 31 to find a 
triangle whose weight is maximum with respect to / in 0{n^^^^^^~^^) time and 0{m!^'^^~^^^^^^'^~^^'^) 
time, respectively. 

We conclude this section with the following proposition. 

Proposition 3.3 If G = {V, E) is a graph with n vertices, and w : V ^ ^ is a weight function, 
then a (not necessarily induced) maximum weight K2^k-suhgraph can he found in 0(n^+-^/(^~'^)). 

Proof: To find the heaviest K2^k we simply need to find, for any two vertices the k largest 
weighted vertices vi, . . . ,Vk so that each Vi is a common neighbor of i and j. As in Lemma l3. 11 this 
reduces to finding the last k maximum witnesses of a 0-1 matrix product. A simple modification of 
the algorithm in Theorem 11.21 achieves this goal in the same running time (recall that k is fixed). □ 

4 Heaviest iif-subgraphs of real edge-weighted graphs 

Given a vertex-colored graph G with n vertices, an //-subgraph of G is called colorful if each 
vertex of H has a distinct color. The color coding method presented in |AYZ95j is based upon 
two important facts. The first one is that, in many cases, finding a colorful //-subgraph is easier 
than finding an //-subgraph in an uncolored graph. The second one is that in a random vertex 
coloring with k colors, an //-subgraph with k vertices becomes colorful with probability kl/k^ > e~^ 




time. By choosing A = m^^ 3w) ^.j^g result follows. 



□ 
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and, furthermore, there is a derandomization technique that constructs a family of not too many 
colorings, so that each ff-subgraph is colorful in at least one of the colorings. The derandomization 
technique, described in |AYZ95j . constructs a family of colorings of size O(logn) whenever k is 
fixed. It is based upon a construction of k-perfect hash functions given in |FKS84j and in |SS90j . and 
constructions of small probability spaces that admit almost i-wise independent random variables 
|NN90j . The size of the constructed family of colorings is only O(logn) where k is fixed. 

By the color coding method, in order to prove Theorem 11.41 it suffices to prove that, given a 
coloring of the vertices of the graph with k colors, a colorful cycle of length k of maximum weight 
(if one exists) can be found in 0(m^~"'^/l^'^/^^ ) time. 

Proof of Theorem II. 4t Assume that the vertices of G are colored with the colors 1, . . . ,k. 
We first show that for each vertex n, a maximum weight colorful cycle of length k that passes 
through u can be found in 0{m) time. For a permutation vr of 1, . . . , A;, we show that a maximum 
weight cycle of the form u = vi,V2, ■ ■ ■ ,Vk'vo. which the color of Vi is 7r(i) can be found in 0{m) time. 
Without loss of generality, assume vr is the identity. For j = 2, . . . , A; let V^- be the set of vertices 
whose color is j so that there is a path from uto v &Vj colored consecutively by the colors 1, . . . , j. 
Let S{v) be the set of vertices of such a path with maximum possible weight. Denote this weight 
by w{v). Clearly, Vj can be created from Vj-i in 0{ra) time by examining the neighbors of each 
V G Vj-i colored with j. Now, let Wu = Tam.^^.^^ w{v) + w{v,u). Thus, Wu is the maximum weight 
of a cycle passing through u, of the desired form, and a cycle with this weight can be retrieved as 
well. 

We prove the theorem when k is even. The odd case is similar. Let A = m?^^. There are 
at most 2m/A = 0{vn}~'^/^) vertices with degree at least A. For each vertex u with degree at 
least A we find a maximum weight colorful cycle of length k that passes through u. This can 
be done in 0{m?~'^/^) time. It now suffices to find a maximum weight colorful cycle of length k 
in the subgraph G' of G induced by the vertices with maximum degree less than A. Consider a 
permutation vr of 1, . . . , A;. For a pair of vertices x, y, let Si be the set of all paths of length A;/2 
colored consecutively by '/r(l), . . . , 7r(A;/2), 7r(A;/2 + 1). There are at most mA'^/^"-^ = rr?~'^l^ such 
paths and they can be found using the greedy algorithm in 0{rn?'~'^/^) time. Similarly, let 5*2 be 
the set of all paths of length k/2 colored consecutively by i:{k/2 + 1), . . . , 7r(A;), 7r(l). If u,v are 
endpoints of at least one path in Si then let fi{{u,v}) be the maximum weight of such a path. 
Similarly define f2{{u,v}). We can therefore find, in Oivn?^'^^^) a pair u,v (if one exists) so that 
/i({ti,f}) + /2({u,w}) is maximized. By performing this procedure for each permutation, we find 
a maximum weight colorful cycle of length A; in G'. □ 

The definition of distance products, mentioned in the previous section, carries over to rectangu- 
lar matrices. Let A = An^xn2 B = Bn^xns be two matrices with entries in 3?? U oo. In this case, 
the distance product G = A-k B is an ni x matrix with C[i, j] = min^^i ^[h k] + B[k,j]. By 
partitioning the matrices into blocks it is obvious that Chan's algorithm for distance products of 
square matrices can be used to compute the distance product of an ni x n2 matrix and an n2 x 
matrix in 0(nin2n3/ log minjni, n2, ^3}) time. The analogous M^X version of distance products 
(namely, replacing MIN with MAX in the definition, and allowing —00) can be used to solve the 
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MAX If-SUBGRAPH problem in edge weighted graphs. 

Proof of Theorem 11.51 We prove the theorem for H = Kh- The proof for other induced 
//-subgraphs is essentially the same. Partition h into a sum of three positive integers a + b + c = h. 
Let Sa be the set of all i^a-subgraphs of G. Notice that \Sa\ < and that each U £ Sa is an a-set. 
Similarly define Sb and Sc- We define A to be the matrix whose rows are indexed by Sa and whose 
columns are indexed by Sb- The entry A[U, U'] is defined to be — oo if [/ U C/' does not induce a 
Ka+bi otherwise it is defined to be the sum of the weights of the edges induced by C/ U C/'. We 
define B to be the matrix whose rows are indexed by S^ and whose columns are indexed by Sc- 
The entry A\U., U'] is defined to by — oo if C/ U f/' does not induce a Kb+c-, otherwise, it is defined to 
be the sum of the weights of the edges induced by C/ U C/' with at least one endpoint in U' - Notice 
the difference in the definitions of A and B- Let C = A-kB. The time to compute C using Chan's 
algorithm is 0(ra'^/logn). Now, for each U & Sa and U' G Sc so that U U U' induces a Ka+c, let 
'w{U, U') be the sum of the weights of the edges with one endpoint in U and the other in U' plus 
the value of C[C/, C/']. If w{U,U') is finite then it is the weight of the heaviest that contains 
U UU' , otherwise no contains U UU' . □ 

The weighted DENSE fc-SUBGRAPH problem (see, e.g., lEKPOlj ) is to find a k-veitex subgraph 
with maximum total edge weight. A simple modification of the algorithm of Theorem 11.51 solves 
this problem in 0(n'^/ log n) time. To our knowledge, this is the first non-trivial algorithm for this 
problem. Note that the maximum total weight of a ^-subgraph can potentially be much larger than 
a fe-clique's total weight. 

Proof of Theorem 11.61 We use the color coding method, and an idea similar to Lemma 
3.2 in |AYZ95j . Given a coloring of the vertices with k colors, it suffices to show how to find the 
heaviest colorful path of length k — 1 connecting any pair of vertices in 2°('=)n3 / log n time. It will 
be convenient to assume that A; is a power of two, and use recursion. Let Ci be a set oik/2 distinct 
colors, and let C2 be the complementary set of colors. Let Vi be the set of vertices colored by 
colors from Ci for i = 1,2. Let Gi be the subgraph induced by Vi- Recursively find, for each pair 
of vertices in Gi , the maximum weight colorful path of length A;/2 — 1 . We record this information 
in matrices Ai,A2, where the rows and columns of Ai are indexed by Vi- Let B be the matrix 
whose rows are indexed by Vi and whose columns are indexed by V2 where B[u,v] = w{u,v)- The 
max-distance product Dc^,C2 — {Ai-k B)'k A2 gives, for each pair of vertices of G, all heaviest paths 
of length k — \ where the first k/2 vertices are colored by colors from Ci and the last k/2 vertices 
are colored by colors from G2- By considering all (^^2) < possible choices for (Ci,C2), and 
computing Dci,C2 for each choice, we can obtain an n x n matrix D where D[u,v] is the heaviest 
colorful path of length k — 1 between u and v- The number of distance products computed using this 
approach satisfies the recurrence t{k) < 2^t{k/2). Thus, the overall running time is 2'^('')n^/ logn. 

□ 

The proof of Theorem 11.61 shows that, as long as k = o(loglogn), a cycle with k vertices and 
maximum weight can be found, with high probability, in o(n^) time. We note once again that 
all of our algorithms also apply to the MIN version of the problems. The previous best known 
algorithm for finding a minimum weight cycle of length k, in real weighted graphs, has running 



18 



time 0{k\n^2^) |PV91j . 



5 Monochromatic and rainbow i7-subgraphs 

Proof of Theorem II. 7t Assume that H has t edges. The problem of finding a rainbow H- 
subgraph in G can be reduced, at a small cost, to the problem of finding a rainbow //-subgraph 
in another edge-coloring of G where the number of colors used is only t. Assume C is the set of 
colors used in G and consider a function / : G {1, . . . ,t}. This defines a new edge-coloring of 
G. Clearly, if an //-subgraph is not rainbow in the original coloring then it is also not rainbow in 
the new coloring. If / is constructed at random, a rainbow //-subgraph in the original coloring is 
also rainbow in the new coloring with probability > e~*. As in the color coding method, this 
method can be derandomized by constructing O(logm) = O(logn) colorings with only t colors used 
in each of them, so that if H is originally rainbow, it will also be rainbow in one of the constructed 
colorings. 

We may now assume that c : E ^ {1, . . . ,t} and show how to find a rainbow //-subgraph if 
it exists, in 0{n^^^^) time. We shall assume that H = Kh and h = 3k + j where j G {0,1,2}. 
The proof for other types of subgraphs is similar. By our assumption, t = (2) • Consider a 
partition of {1, . . . , i} into 6 parts Ci, C2, C3, C4, C5, Cg. The respective sizes are \Gi\ = \G2\ = (2) , 
IC3I = (''+^'), IC4I = k^, \G5\ = \Ge\ = k{k + j). Notice that there are 2"^^ choices for the partition. 
For each partition we construct two Boolean matrices A and B that are defined as follows. The 
rows of A are indexed by all the rainbow subgraphs of G that use the colors from Ci. The 
columns are indexed by all the rainbow Kp^ subgraphs of G that use the colors from C2. We define 
^[X, y] = lifXny = and the bipartite subgraph induced by the parts X and Y is complete, 
rainbow, and uses the colors from C4, otherwise ^[X, y] = 0. The rows of B are indexed exactly 
in the same order as the columns of A. Namely, by all the rainbow Kk subgraphs of G that use 
the colors from C2. The columns are indexed by all the rainbow /^fc+j subgraphs of G that use 
the colors from C3 . We define B [X, Y] = lifXny = and the bipartite subgraph induced by 
the parts X and Y is complete, rainbow, and uses the colors from C5, otherwise 74[X, y] = 0. The 
Boolean product G = AB can be performed in 0{n^^^^) time using fast matrix multiplication. 
Now, if G[X, y] = 1 and X r\Y = $ and the bipartite subgraph of G induced by the parts X and 
y is complete, rainbow, and uses colors from Gq then we must have that X U y is contained in a 
rainbow subgraph of G. By considering all possible partitions we are guaranteed not to miss a 
single rainbow /^T/j-subgraph of G. □ 

Proof of Theorem II. 8t If // is a star, the theorem is trivial. Next, assume that H is not a 
star and has at least five vertices. Thus, H has two independent edges and at least one additional 
vertex. Put h = 2>k + j and consider a labeling of the vertices of H with {1, . . . , /i} so that the 
following holds. If we partition {1, . . . , /i} into three consecutive parts, as equally as possible, then 
the subgraph of H induced by the first part contains an edge ei and the subgraph induced by the 
second part contains an edge 62. Thus, e.g., if H is the 5-cycle (1,2,3,4,5) a plausible partition is 
{1, 2}{3, 4}{5}, ei = (1,2) and 62 = (3,4). Denote by //i, H2, and //a, the labeled subgraphs of H 
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induced by each of the parts and denote their respective sizes by /ii, /ii, /13. Thus, if j = we must 
have hi = h2 = = k, if j = 1 we can assume hi = k + 1 and h2 = hs = k and if j = 2 we can 
assume hi = h2 = k + 1 and /13 = k. 

We create a Boolean matrix A as follows. The rows of A are indexed by all the ordered /ii-tuples 
of vertices and the columns by all the ordered /12-tuples. We put ^[X, y] = 1 ii X CiY = ^ and 
the mapping that assigns the i'th vertex of X to i and the £'th vertex oi Y to hi + £ corresponds 
to a monochromatic labeled copy of Hi U H2- In particular, note that the edge mapped to ei has 
the same color as the edge mapped to 62- We create a Boolean matrix B as follows. The rows of 
B are indexed exactly like the columns of A. The columns of B are indexed by all the ordered 
/13-tuples. We put y] = lifXny = and the mapping that assigns the i'th vertex of X 
to hi + i and the ^'th vertex ofYtohi + h2 + i corresponds to a monochromatic labeled copy of 
H2 U H3. Let C = AB be the Boolean product. Suppose that C[X, y] = 1 and suppose also that 
X \JY corresponds to a monochromatic labeled copy of Hi U H^. Let Z satisfy A[X, Z] = I and 
B[Z,Y] = 1. Then we must have that X UY L) Z corresponds to a monochromatic copy of H. This 
is because the color of each mapped edge is either that of the edge of G mapped to ei or that of 
the edge of G mapped to 62 but these two are also colored the same. Notice also that if there is 
a monochromatic //-subgraph, it would be captured by our algorithm. Since the time needed to 
compute C is 0{n'^^~^^), the result follows. 

Consider next the case h = A. If H ^ K4 then we can assume that H is labeled by {1,2,3,4} 
so that (1,4) is not an edge and (2,3) is an edge. Thus, the same algorithm described above using 
the partition {1}{2,3}{4} yields an 0{n^'^^) time algorithm for detecting a monochromatic H. 
li H = K4 the algorithm is slightly different. For each v G G, let Si,..., St be a partition of 
the neighbors of v so that x,y & Si if and only if c{v,x) = c{v,y). Searching for a triangle in 
the subgraph induced by Si all of whose edges are colored by a given specific color has the same 
complexity as searching for an uncolored triangle in a graph, and hence can be done in 0(|«S'j|'^) 
time. Thus, in 0{^l^^ \Si\^) < O(n^) we can find a monochromatic triangle containing v, if it 
exists. Performing this procedure for each v & V gives the desired 0{n^^^) time algorithm. 

The only remaining case is H = K3. Let Ei be the set of edges of G colored with i. We say that i 
is heavily used if l-Ei] > n^'^^^^^'^. Clearly, the number of colors heavily used is at most 0{n'^~^'^^^^^'^). 
For each heavily used color i we can decide, in 0{n^) time, whether there is a monochromatic 
triangle colored with i. The overall running time is, therefore 0{n^^'^^^^^^'^^'^) = 0{n^^^^'^/'^). 
For each color i that is not heavily used, we can decide in 0(|-E'jp'^/^'^+"^^) time whether there is a 
monochromatic triangle colored with i using the algorithm from |AYZ97j . The overall running time 
is maximized if \Ei\ = Q{n^^~^^'>/'^) and when there are Q{'n?'~^^^^^/'^) such colors. In this extremal 
case the running time is still only 0(n«'^+i)/2)(2<^/('^+i))+2-{<^+i)/2) ^ q(^{3+u,)/2)^ |-| 

6 Concluding remarks and open problems 

We presented several algorithms for MAX //-SUBGRAPH in both real vertex weighted or real 
edge weighted graphs, and results for the related problem of finding monochromatic or rainbow 
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//-subgraphs in edge-colored graphs. It may be possible to improve upon the running times of 
some of our algorithms. More specifically, we raise the following open problems. 

(i) Can the exponent t{uj,3) in Theorem 11.21 be improved? If so, this would immediately imply an 
improved algorithm for maximum witnesses. 

(ii) Can the logarithmic factor in Theorem 11.41 be eliminated? We know from |AYZ97j that this is 
the case in the unweighted version of the problem. Can the logarithmic factor in Theorem 11.71 be 
eliminated? 

(iii) Can monochromatic triangles be detected faster than the 0{n^^~^^^/'^) algorithm of Theorem 
11.81 .'' In particular, can they be detected in 0{n'^) time? 

(iv) Is the dominance matrix for a set of n points in n dimensions computable in ©(n"^) time? 
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